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Abstract. A classical theorem of S. Bochner states that a function / : R. n — > 
C is the Fourier transform of a finite Borel measure if and only if / is positive 
definite. In 1938, I. Schoenberg found a beautiful complement to Bochner's 
theorem. We present a non-technical derivation of of Schoenberg's theorem 
that relies chiefly on the de Finetti theorem and the law of large numbers of 
classical probability theory. 



1. Introduction 

A real-valued function g of n vectors is said to be positive semi- definite (some- 
times, positive definite) if 2i=i d( x i ~ x j) c i^j ^ f° r au " "-vectors xi, . . . ,Xk 
and all complex numbers c±, . . . , c^. 

A classical theorem of S. Bochner (|1955l Theorem 3.2.3, p. 58) asserts that 
positive semi-definite functions are precisely those that are Fourier transforms of 
finite measures. Let || • ||„ denote the usual Euclidean norm in n dimensions. That 
is, ||a;|| n = {x\ + ■ ■ ■ + xf L ) 1 ^ 2 for all x £ R™. Then, the goal of this note is to present 
a very simple proof of the following well-known theorem of I. J. Schoenberg (1938 
Theorem 2): 

Schoenberg's Theorem. Suppose f : R+ — > R+ is continuous. Then, the fol- 
lowing are equivalent: 

(1) The function R™ 3 x t— > f{\\x\\ n ) is positive semi- definite. 

(2) The function R + 9 f(Vt) is the Laplace transform of a finite Borel 
measure on R + . 

Originally, this theorem was used to describe isometric embeddings of Hilbert 
spaces. Since its discovery, it has also found non-trivial connections to other diverse 
areas ranging from classical, as well as abstract, harmonic analysis ( |Berg and Ressel, 1978| 
|Berg et al., 1984||Kahane, 1985| ) to the measure theory of Banach spaces ( |Bretagnolle et al., 1965| 
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, 1965/1966||Bretagnolle et al., 1967||Christensen and Ressel, 1982||KoTdobsky, 1996| 



|Misiewicz, i996a|[7l996b||Koldobsky, 1999||Koldobsky and Lonke, 1999| ), function 
theory ( |Ressel, 1974| |Diaconis and Freedman, 2004a| |, 2004b| ) and to the foun- 
dations of statistics via de Finetti-type theorems (Freed man, 1963| |Ressel, 1985| 
|Diaconis and Freedman, 2004a| |7 2004b| ) . For other relations, in particular, to sta- 



tistical mechanics, see the detailed historical section of Diaconis and Freedman (2004b I 

Although Schoenberg's original proof is not too difficult to follow, it is somewhat 
technical. P. Ressel fT5Y6) has devised a simpler proof which rests on a character- 
ization of Laplace transforms plessel, 1974| Satz 1) that is similar to Schoenberg's 
theorem. We are aware also of another simple proof, due to J. Bretagnolle, D. 
Dacuhna-Castelle, and J.-L. Krivine 1(13551 11965/1966| HM7jl . Their proof is simi- 
lar to the one presented here, but is slightly more technical. 

The present article aims to describe a self-contained, elementary, and brief deriva- 
tion of Schoenberg's theorem. Our proof assumes only a brief acquaintance with 
real analysis and measure-theoretic probability theory. This proof is quite robust 
and can be used to produce more general results; all one needs is a more general 
setting in which a basic form of the de Finetti theorem and the law of large numbers 
hold. 

Since writing the first draft of this paper, we have found out about the work of 
D. Kelker l|1970l Theorem 10). Kelker's proof is essentially the same as ours. J. 
Kingman l|1972(l contains yet another rediscovery of Kelker's proof. 

Acknowledgements. Christian Berg brought to my attention the recent work of 



Steerneman and Perlo-ten Kleij (20051, and Paul Ressel made an important cor- 



rection to the original draft. I am deeply endebted to them both. 

2. The Proof 

All notation and references to probability theory are standard and can be found 
in any standard first-year graduate textbook. 

Without loss of generality, we may suppose that /(0) = 1. Then, thanks to 
Bochner's theorem, Schoenberg's theorem translates to the equivalence of the fol- 
lowing two assertions: 

(1°) For all n > 1 there exists a Borcl probability measure /i„ on R" such that 



(2.1) /*$>?=/ e lX ' V Vn(dy) v x := (xi, . . . ,x n ) e R". 




(2°) There exists a Borel probability measure v on R + such that 

/"OC 

(2.2) f{t) = / e~ t2s / 2 v{ds) v i > 0. 
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Therefore, it suffices to prove that (1°) and (2°) are equivalent. The assertion, 
"(2°)=>(1°)" follows from a direct computation because ||:r|| n >— > exp ( — ||a:|l^s/2) is 
manifestly a Fourier transform on R™. So we prove only the converse. Henceforth, 
we assume that (1°) holds. 

Our next lemma follows immediately from (1°) and the uniqueness theorem. 



Lemma 1. The family {n-n}^! is consistent. 



It might help to recall that "{/^nj^i 1S consistent" means that for all n > 1 and 
all linear Borel sets A\, A 2 , ■ ■ ■ , /U n (Ai x ■ • • x A n ) = [i n+ i(A 1 x • ■ ■ A„ x R). 



Proof of Schoenberg's Theorem. In accord with Lemma^and the Kolmogorov con- 
sistency theorem, there exists an exchangeable stochastic process {Yk} ( j^ 1 , on some 
probability space (f2, P), such that for all n > 1 and all Borel sets A C R™, 

(2.3) P{(Y 1 ,...,Y n )&A}= t i n (A). 

Choose and fix some t > 0, and introduce a sequence {X i }<*L 1 of independent 
random variables such that every Xi has the normal distribution with mean 
and variance t 2 . We can assume, without loss of generality, that the X^s are 
defined on the same probability space (Q, P). We first apply (1°) with x := 
n _1 / 2 (Xi, . . . , X n ), and then take expectations, to deduce that for all n > 1, 




See ()2.3|) for the last identity. Now let n — > oo. The simplest form of the law of 
large numbers dictates that Y^l=i X? I n ~^ VarXi = t 2 in probability. Therefore, 
the left-hand side of (|2 .4|) converges to fit) by the dominated convergence theorem. 

By the de Finetti theorem, the Y^s are conditionally i.i.d. given the exchange- 
able cr-algebra generated by the Y^s. Thanks to the Kolmogorov strong law of 
large numbers, and by the Fubini-theorem, L := limn^oc i X^=i exists a.s. 
Moreover, the event {L < oo} agrees upto null sets with {E^ 2 | S\ < oo}, where 
§ denotes the exchangeable cr-algebra of {^}^i- By the dominated convergence 
theorem, the right-hand side of l|2.4|l converges to E[exp(— t 2 L/2); L < oo]. 

We have proved that f(t) — E[exp(— t 2 L/2)\ L < oo] for a possibly-degenerate 
non-negative random variable L. Set t — to find that I is a proper random 
variable; i.e., 1 = /(0) = P{L < oo}. Therefore, (2°) follows with v denoting the 
distribution of L. □ 
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